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We study a new ensemble of random correlation matrices related to multivariate Student (or more 
generally elliptic) random variables. We establish the exact density of states of empirical correlation 
matrices that generalizes the Marcenko-Pastur result. The comparison between the theoretical 
density of states in the Student case and empirical financial data is surprisingly good, even if we 
are still able to detect systematic deviations. Finally, we compute explicitely the Kullback-Leibler 
entropies of empirical Student matrices, which are found to be independent of the true correlation 
matrix, as in the Gaussian case. We provide numerically exact values for these Kullback-Leibler 
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I Estimating and analyzing the correlation matrix of N different variables from a data set is very recurrent problem 
i in statistical analysis. Typically, one observes the value of the N different variables over a time period of size T. The 
total number of data point is NT whereas the number of elements of the correlation matrix is N{N — l)/2. In many 
applications, both T and N are large but Q = T/N is of order unity. For example, in financial applications the values 
of T and TV go from few hundreds to a few thousands. In these cases, the correlation (or covariance) matrix is in fact 
rather poorly determined. This means, in particular, substantial noise in the determination of the eigenvalues and 
' eigenvectors. For instance, focus on the case where all random variables are in fact independent, such that the true 
Lf^ [ correlation matrix Cij is the identity matrix. The empirical determination of C, which wc call E, is obtained from 
^ I' the zero mean, unit variance Gaussian variables r* using the Pearson estimator: 

E. = ^t<rj (1) 

<N . . 

, The eigenvalue spectrum p{X) of E only approaches a (5-function at A = 1 when Q ^ oo. When Q is finite, however 
OO ' p{\) is non-trivial. Under mild hypothesis about the distribution of the r\s (essentially that the variance is finite) 
, the spectrum approaches, in the large N,T limit, the Marcenko-Pastur distribution [l|: 



P(A)^ ^^^' itxq '^^ Ae[(l-y^)^(l + V^)^] (g = l/Q<l). (2) 

^ \ For finite N, the edges of the spectrum are smoothed. The statistics of exceptionally large eigenvalues is described by 
• '~j ■ the Tracy- Widom distribution , or by a Frechet distribution when the rt have power-law tails decaying sufficientjv 
/\ ' slowly 0. The probability of large fluctuations of the maximum eigenvalue (for Gaussian rt) has been derived in [3]. 
^ , In this work wc focus on a more general ensemble that often provides a faithful representation of empirical data. 
We will consider that the random variables r* can be written as a product of two terms: r* = cr*?7* where 77* and cr* 
are independent random variables. The 77- arc characterized by a true correlation matrix Cij = {rjirjj) and cr* are such 
that (cr^) = 1. Furthermore, 77* are drawn independently from the same distribution at each time t. A particular case 
on which we shall focus extensively corresponds to Gaussian distributed rji and 



P(a)^^cxp 



(3) 



where CTq — 2/i/(/^ — 2) in such a way that {cr'^) = 1 (r(a;) is the Gamma function). This case corresponds to a 
multivariate Student distribution for the original variables r*. This defines the Student Ensemble we will consider 
more specifically below, although many of our results extend to arbitrary choice of P{cr) and -P({77*}), defining a 
large class of elliptic distributions. We will also focus on applications of our results to finance where the correct 
determination of the true correlation matrix between stocks play a very important role. 

The application of Random Matrix Theory to financial correlation matrices was first suggested in [1, @, 0, ■ In 
this context r* are daily (or higher frequency) returns of N different stocks over a time period of size T. As discussed 
above, even in the absence of any correlation between stocks, we expect the eigenvalues of an empirical correlation 



2 



matrix, determined over a finite time interval, to be non trivial. As a consequence, distinguishing between noise and 
genuine information in the empirical density of states is a subtle matter. For example, the diagonalisation of the 
correlation matrix of, say, 450 stocks computed over a bit more than 4 years (1125 trading days) reveals one very 
large eigenvalue corresponding to a roughly uniform eigenvector, corresponding to the "market mode" , and a handful 
of other large eigenvalues that can be seen to correspond to large sectors of economic activity. Smaller eigenvalues, 
however, form a blob around A = 1. Is this blob well described by the Marcenko-Pastur distribution? In order to 
answer this question one has to rescalc the small (supposedly "pure noise" ) empirical eigenvalues in order to have 
A = 1 and compare to a Marcenko-Pastur distribution (for which one has by definition A = 1). Different results are 
obtained depending on the number of largest eigenvalues that are considered meaningful. They are shown in Figs. 3 
and 4 below. The agreement between empirical data and the Marcenko-Pastur distribution is seen to be acceptable, 
although some systematic deviations are observed d, ^ . Should these deviations then be interpreted as the presence 
of true economic information hidden in the noise band, or are we missing an important effect? The phenomenology 
of financial markets suggests that a more faithful model consists in assuming that all individual stock returns are 
impacted by the same, time dependent scale factor at that represents the "market volatility": r* = 0-477*, where -ql 
are zero mean, unit variance Gaussian variables, and at is a random variable. From empirical studies, we know that 
the (Jt's have long range temporal correlations but for the purpose of the present study we only have to specify the 
marginal distribution P{a). One possible choice that matches quite well the data is to choose Eq. ([3]) with /i ~ 3 — 5. 
This corresponds to a multivariate Student distribution for the returns, discussed for example in [3^[l0j. This is the 
model we will consider more specifically below. Another possible choice, inspired by the multifractal random walk 
model [ll|, is a log-normal distribution for P{a). 



II. WISHART-STUDENT MATRICES 



A. Density of states of Pearson and Maximum Likelihood Estimators 

The first question we address is the generalisation of the Marcenko-Pastur spectrum for multivariate Student 
variables, when the Pearson estimate Eq. ([T]) is used to determine the empirical correlation matrix. The computation 
of the density of states (DOS) can be straightforwardly performed using free random matrices techniques [12l |. The 
trick is to use the so called Blue function which is the inverse of the resolvent G: B(G{z)) = z. The quantity 
B{x) — 1/a; is called the R-transform of G and under certain hypotheses, obeyed by any elliptic Wishart ensemble 
such that Gij = {rji-qj) = 5ij, is known to be additive [T^.fTsj 

Since any elementary matrix a^Tf^rfj is a projector, its resolvent is simply: 

where Q = T/N and cr^ = Jl/st (henceforth, Jl ~ /i/2 — 1), such that P{s) — s^/'^~^e~'^ /T{^/2) in the Student case. 
We have used that in the large N limit = 1- Inverting the resolvent at leading order we find: 

1 ^ 

Bt{x) = - + - (5) 

Using the additive properties of the R-transform we finally find the Blue function for E: 

where the second identity is due to the large T, N limit at fixed Q. 

The relation between the resolvent and the density of states is G(A — ie) — Gh(A) -|- i7rp(A), where Gr is the real 
part of the resolvent. Inverting this relation we find two coupled equations on Gr,p: 
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FIG. 1: Comparison between the Wishart Student distribution for Q 
the numerical result obtained for = 50 and 8000 samples. 



2 and = 6, solution of the previous equations, and 



Note that these equations are actually valid for any P{(t) (when tails are not too heavy). The last equation of course 
always admits the p = solution. At very small A, the solution of these equations is indeed p = 0. The corresponding 
Gr solves the equation 



A 



1 

Gr 



dsP{s) ^ 



s ~ ^iGr/Q 



(9) 



The RHS is well defined only for negative Gr, it goes to zero for very large and negative Gr, it goes to minus infinity 
at Gr = 0~ and it has a maximum somewhere in between. The maximum of the RHS corresponds to the largest 
value of A for which there is a real solution, i.e. to the left edge of the DOS. It can be determined obtaining the value 
of Gr where the RHS of the previous equation has a maximum: 



1 = / dsP{s) 



-TIGr/QY 



(10) 



and plugging this value into ([9]). When P{s) extends to s = 0, as is the case for Student variables, there is no real 
solution for larger values of A. This implies that the DOS has no right edge in that case. In order to determine the 
DOS right tails we focus on the large A limit of eqs (|7|8p . It is easy to check that Eq. ([7]) is solved in the large A 
limit by Gr ~ 1/A if p/Gr goes to zero. We will check that this is indeed the case after having determined p(A). In 
order to do so, we analyze Eq. ([8]) assuming p/Gr 0. In this case the integral can be computed exactly and one 
gets (for large A): 



P(A) 



p/2 



1 



r(p/2)QA'/2-i Ai+M/2- 



(11) 



This result abides our initial assumption p/Gr = p\ ^ Q when A — > oo. It can be interpreted in terms of rare 
events. Indeed the distribution of a has exactly the same power law tail. A very large a* on a given day, i, leads 
to an quasi-eigenvalue A ~ (y*'^IQ plus subleading contribution. As a consequence, writing (for s « 1,A >> 1) 
TP{s)ds = Np{X)dX allows one recover precisely the left tail of the DOS. As expected, provided p > 2, the power-tail 
disappears in the limit Q — > oo. 

The Wishart-Studcnt distribution for Q = 2 and p ~ 6, solution of the previous equations, is plotted in Fig. [T]and 
compared to a numerical result obtained for iV = 50 and 8000 samples. The agreement is excellent. 

We should however point out that the Pearson estimator is not, in the case of Student variables, the maximum 
likelihood estimator of C. The formula for this estimator was worked out in Q and is given instead by the solution 
of: 



E* 



N - 



T 



E 



.t 

7nn ' n 



(12) 
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FIG. 2: Comparison between the numerical solution of the self-consistent equation (|12p for A*' = 50, 80, 150 and Q = 2.5 (500 
samples) and the Marcenko-Pastur distribution. The agreement is excellent. Notice that the blurring of the MP right edge is 
a finite size effect, as shown by the evolution from = 50 to = 150. 

This reproduces the usual Pearson estimate when — > oo at fixed A'^. We are interested in the other limit N ^ oo 
at fixed ^. In this case /i can be dropped everywhere. As a consequence, the previous equation simplifies into: 



E* 



1 



t Z^mn In 



{E* ^)m7i11n 



(13) 



Furthermore, in the large N, T limit at fixed Q the denominator in the RHS is expected to become self-averaging and 
independent of t at leading order. Since the above equation only fixes E* up to an arbitrary multiplicative constant, 
we fix the value of the denominator to unity, which we can then verify self consistently. Since the 77* are Gaussian 
random variables with unit variance, the maximum likelihood estimator of C is a Wishart matrix, and the eigenvalue 
spectrum is again given by the Marcenko-Pastur distribution. In order to check that the denominator is indeed equal 
to unity, we break E* into two contributions: E^ + l/ {T)r]lrfj , where E^ is the part of the Wishart Matrix independent 
of the T]\. Expanding the expression for the denominator in powers of l/(T)7/*7/*, one finds at leading order in N: 



(Tr(£;*)-i)/iV 



1 



1 + Tr(i;*)-V(Q) 



(14) 



where we have used that in the large N,T hmit TyE* /N k. TtE^/N and E^('?*)V^ = 1- Rccalhng that the trace 
of the inverse of a Gaussian Wishart correlation matrix equals Q/{Q — 1) one can straightforwardly verify that the 
denominator is indeed equal to one. We have tested our result by numerical simulation by solving numerically the 
self-consistent equation for N = 50, 80, 150 and Q = 2.5. The density of states, averaged over 500 samples, is 
compared to the Marcenko-Pastur distribution in Fig. [2] The agreement is excellent, thus confirming our analytical 
result. 

This result is quite remarkable, especially from the point of view of cleaning noisy correlation matrices. Using the 
Maximum Likelihood Estimate of C one improves a lot the estimator; it allows to remove completely the effect of the 
noise due to a fluctuations, in particular to cut the noisy power-law tails of the Wishart-Student distribution. 

At this stage, it is interesting to discuss the generality of the above results. First, remark that the DOS could also 
be computed for an arbitrary correlation true matrix Cij = {TjiJij) using free random matrix theory and the so-called 
S-transform: since in this case E = VC'EwsVC, where Ews is a Wishart-Student empirical matrix considered above, 
the eigenvalues of E will be the same as those of the product CEws- The spectrum of E can be computed from the 
S-tranform of both C and ^ws, extending the classical result for Wishart matrices. Furthermore, our derivation of 
Eqs. (|7l8p does not require Gaussian r/iS but only that J^iivl)'^ = 1 for large N. Finally, although we focused on 
a particular shape P{<t), Eqs. ()7|8|) generalizes straightforwardly to any P((t) with not too heavy tails, for example a 
log-normal distribution. 
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FIG. 3: Comparison between the analytical results, Student (for /i = 3.85) and Gaussian Wishart, and the empirical data. 

B. Applications to financial data 

In the following we compare the Wishart-Student distribution to empirical data. We have considered the daily 
returns of 450 stocks of SP-500 from 2003 to 2007 and computed the empirical (Pearson) correlation matrices E for 
Q = 2.5. The resulting average density of states is compared to the Marcenko-Pastur DOS and the Wishart-Student 
DOS for /i = 4 and fi = 5. Note that we renormalized the empirical DOS by 1 — X^fc^" ^k/N where k runs over the 
indexes of the largest Km eigenvalues. As discussed in the introduction, this is done in order to subtract a clearly non 
random contribution. Although the first few eigenvalues are certainly non-random the precise choice of Km is a subtle 
matter. Different analysis suggest that Km « 10 Q. In the following we have chosen to determine Km directly from 
the Wishart-Student DOS: we determine the values of A0.5, A0.9 such that the probability that all sampled eigenvalues 
are less than these values is either 0.5 or 0.9 (of course assuming that the underlying distribution is Student). All 
eigenvalues that are larger than these cut-offs are assumed to be meaningful. A0.5, Ag.g depend on fi; furthermore the 
determination of Km can be altered by other effects not taken into account in our simple model. Therefore we have 
verified that our results are stable when Km. is between 2 and 10. In Fig. [3]and|4]we show the comparison between the 
analytical results, Student and Gaussian Wishart, and the empirical data. We have used 20 samples corresponding 
to a sliding average with step of 15 days. The two figures correspond respectively to the /i = 3.85 and /i = 5 case. In 
the first case we renormalized the empirical data considering the first or the first two eigenvalue as meaningful. In 
the second case we considered the first three or the first five as meaningful. 

The agreement with the Wishart-Student DOS is surprisingly good. The optimal value of appears to be close to 
/i = 3.85 in agreement with the value of fi obtained from the marginal distribution of daily returns {ji ~ 3.85, see 0). 
As stressed above these results are not affected much as as long as Km is between 2 and 10. 

In order to test directly the hypothesis that the returns are multivariate Student variables, we have repeated the 
same analysis with daily returns scaled by a proxy of the instantaneous volatility, namely: 

vl = I (15) 

and studied the eigenvalue spectrum of Eij = l/TX^t ^i^j: again subtracting the top eigenvalues. Note that we have 
normalized each return by tr* = X]t'5^t(^i )^/^ discussed in 

In theory, the resulting spectrum should now be well fitted by the Marcenko-Pastur distribution. As shown in Fig. 
[SJ the spectrum of E indeed moves closer to the Marcenko-Pastur result but is still distinctly different. However, the 
rescalcd returns 77* are still found have fat-tails; a possible explanation is that the volatility of a given stock fluctuates 
not only through a common (market) factor at but also through common sectorial volatilities, which could explain the 
deviation from Marcenko-Pastur. Another possibility is of course that there are still non-trivial eigenvectors within 
the "noise blob". We leave the detailed study of this question for further studies. 
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FIG. 5: Comparison between the standard DOS of Wishart correlation matrices obtained from empirical data (Empirical-10) 
and tlie one obtained renormalizing the returns by a proxy of the daily volatility (Emp-normalized-10). Both are rescaled using 
Km = 10 (Q = 2.5). MP and Student are the analytical Marcenko-Pastur and Student {/j, = 4) results. 

III. KULLBACK-LEIBLER ENTROPY 

The KuUback-Lcibler (KL) entropy allows one to measure the distance between two probability distributions and 
is defined in the following way [l^: 

5(2|1) = J dxP,ix)log(^^^ (16) 

It is easy to show that this entropy is semi-positive definite using that 5*21 = — (logP2/^'i)pi ^ — ^og{P2 / Pi) = 
— log(l)p2 ~ 0. Furthermore its minimum is reached for Pi = P2- As a consequence it is a possible measure how 
much the distribution P2 differs from Pi. Note however that it is an asymmetric measure: 5(112) 7^ S'(2|l). 

Tumminello et al. |16| computed the KL entropy for two multivariate Gaussian distributions with correlation 
matrices Ci_2 and found: 



5(C2;Ci) = -- [TrlogC2-iCi + Ty{C^'Ci) - Trl] 



(17) 



7 



From this expression, they estabHshed that for iV-multivariate Gaussian variables (stock returns) with true correlation 
matrix C, the following holds: 

• The average of ^(E; C) is independent of C for all N where E is the Pearson estimator of C on a T long time 
series: (5(E; C)) = Z{N,T) 

• The average of S'(Ei;E2) is also independent of the true correlation C, where Ei,E2 are the empiri- 
cal correlations corresponding to two independent realisations of the multivariate process described by C: 
{S{B^■,E2)) = Z'{N,T) 

This is a very interesting remark because in cases where the a priori distribution is indeed Gaussian one can judge 
the relative performance of different cleaning procedures of E without knowing the true correlation matrix C, by 
computing S'(E;Ec;) where E^; is the cleaned empirical correlation matrix. The above two values Z,Z' provide 
interesting benchmark values for S. The best one can do is to recover the true correlation matrix: Ec/ = C, so 
5(E; Ec/) < Z means that some noise remains in the cleaned matrix, while 5(E; E^/) > Z means that the cleaning 
is too violent and introduces some distortion. On the other hand, the most trivial cleaning procedure is doing 
nothing: Ecj = E. Therefore, considering two independent realizations, a good cleaning procedure should be such 
that S'(Ei;E2,ci) < Z'. Furthermore, Z' is also interesting to estimate the reproducibility of the filtering procedure 
by comparing it to S{'Ei.ci;'Ei2.ci): see the discussion in [la]. We also remark also that the value of the KL entropies 
cited above (divided by N) is self-averaging in the limit of a very large number of stocks N. 

As discussed above, the distribution of stock returns is not Gaussian but rather multivariate Student with an 
exponent /i « 4 — 5. [Note that the marginal distribution of single stock returns is then also found to be a Student-t 
distribution]. In order to apply the above ideas to real data, we need to extend the results of Tumminello et al. 
[Toj to multivariate elliptic distributions, parameterized by an arbitrary distribution P{s) of the inverse variance. As 
explained above, the Gaussian case corresponds to P{s) = 5{s — Jl) and the Student case corresponds to P{s) = 
5M/2-ie-7r(///2). 



A. Kullback-Leibler entropy for elliptic laws 

In order to compute 5(2] 1) for generic elliptic laws one has to compute: 

5i2 = J dxPi(x)log(F2(x)), (18) 

where x is an A^-dimensional vector. The KuUback Leibler entropy will then be obtained as — S'12. Therefore all 
constant terms, i.e. independent of the correlation matrix C, cancel between the two contributions. 
A general expression for S'12 can be worked out using replicas: 

Sri = lim dn I dxi...dxNPi{P2T- (19) 

n^O J 

For any positive integer n one can plug into the above equation the general expression of multivariate elliptic laws 
and find the following expression for £'12: 



lim dn / dxi...dxNdsdsi...ds^P{s)s'"^ W P{sa)s^'^ ^ . '.^ ^ (20) 

J Li + VdetCidet(C2)" 

One can now integrate out the x variables and get: 

5i2 - lim 5„ f dsds,...ds,,P{s) f] P{sa)s'J/'' ( ^ ] . ^ (21) 

J . \ ^/2^ , /detect / . /^^fn ^ n-in, = i .^ 



a=i VV2^ \/det(C2)/ ^dci[l + C^^CiY.^Sals 

Introducing the following identity in the previous expression 



(22) 
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one can finally make the analytic continuation to real n: 



Si2 = lim 9„ / dsdydyP{s) \ ^ } \ - ) ^ = (23) 



^ ^det(C2) / y'det[l + C2"iCiy/ 

It is now possible to differentiate with respect to n, take the limit rt ^ and get the general expression: 

1 f / r dsP(s')s^/'^e^^y\ Q-wv 

Si2 = --TVlogCa + / dsdydyP{s)\og ^ } , ^= + K (24) 

^ J VV2^ Vdct(C2) / y'det[l + C^^Cizj/s] 

where if is a constant independent of the Cs. The final expression for the KuUback-Leibler entropy is therefore: 

-iyy 



S{2\1) = --Trlog[C2-iCi]- dsdydyP{s)\og[ dsP{s 



,N/2isy 



det[l + C-^Ciy/s] 

dsdydyPi^) log dsP(,s)s^/2e-S j + y/s)^/^ ^^^^ 

An important remark that will be very useful below is that this expression can be written in general as >5'(2|1) = 
Tr/(C^^Ci), where / is a function that depends on P(s). In the following we will apply the general expression above 
to the Gaussian case, to check its validity, and to the Student case. 

1. Gaussian distribution 

Let us focus on the multivariate case when P(s) = 5{s —jt). In this case the second term in the general expression 
above simplifies considerably. Up to a constant term that cancels out between the second and the third term and 
after rescaling y,y ^ yjl, y/jl we find: 

i / dydy (26) 
/det[l + C^iCi2/] 



Integrating over y one gets: 

1 



det[l + C^^Ciy] 2 " l + Ca^Ci? 



v=o 



^nC^'C,] (27) 



Repeating the same procedure for the third term, we finally recover the expression derived in [l6|: 

^(2|1) = i [-TrlogC2-iCi +Tr(C2-iCi) -Trl] , (28) 
which can indeed be obtained in a much simpler way. 

2. Student distribution 
In order to proceed we have first to compute: 

which in the case of a Student distribution P{s) = s^^/'^^^e^^ /T{^/2) reads: 



r(M/2) 



(30) 
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We will use this identity to simplify the second term of ([25]) finding: 



dsdydyP{s) log (1 - iy) ^ + K' (31) 

det[l + C2"^Ciy/s] 



In order to be able to integrate over y we apply the identity log z ~ (e * — e ^*)/t to z = 1 — iy. Hence, we obtain 

^ + ^ dsdtPis) ( ^^Pt!) _ ^"P(-^) I (32) 



The double integral is dominated by small values oit/s such that the leading contribution can be obtained expanding 
to the first order in t/s. This can be readily checked when C2^^Ci = 1. Thus, in the large limit, one finds: 

N + , I ^^^^^^^^ ^exp(-t)-exp(-t-mC,-^C,/(2.))^ ^33^ 

Using again the integral expression of the logarithm we find that the second term of (j25p reads: 

^^-i^ J dsP(s) log (l + TrC-iCi/2s) (34) 

Putting all pieces together we find finally the expression for the KuUback-Leibler entropy for multivariate student 
distributions: 

Sim = 4T,log[C-C.| + ^ /..PWlog (iif^^gl^) (35) 

This expression allows one to recover the Gaussian case above in the limit n/N oo, where s~/i/2^7V. As a 
consequence one can expand the logarithm and find the previous expression for Gaussian distribution. The case of 
interest here is instead fJ-/N ^ 1. In this case the previous expression simplifies into: 

5(2|1) = -iTrlog[C2-iCi] + ^ log {TtC^'CJN) (36) 

One can change continuously from the Gaussian case to the Student case by tuning the parameter x = N/ fj, in the large 
N,fi limit. Gaussian and Student correspond respectively to a; = 0, oo. Note that the final expression is independent 
of II in the x = oo case, at least its leading contribution in N which is what we are interested in. 



B. Applications 

Following [l^ we shall now compute Z/N and Z' /N, i.e. Skl{Ci, C2) when Ci = Ei is the empirical correlation 
matrix generated from the a priori correlation matrix C and C2 is either equal to C or is another independent 
empirical correlation matrix E2. The crucial results of [l6j is that these expectation values are independent of the 
true correlation matrix C. This is in fact true in the case of a general multivariate elliptic distribution since the 
final expression can be written as Tr/(C^^Ci). A generic empirical correlation matrix E can indeed be written as: 
E = \/CEo\/C where Eq is a Wishart correlation matrix of independent random variables. As a consequence the 
contribution of C cancels out from traces of powers of E^^Ci and one gets, for example: 

S{2\1) = Tr/(EriC) = Tr/(E„-i) (37) 

i.e. as found in [l6l |. the KuUback-Leibler entropy does not depend on the a priori correlation matrix C. However, 
its value depends on P{s). In order to compare to real data we will compute explicitely these entropies for Gaussian 
and Student distributions. Both are expected to be self-averaging quantities in the large N limit. 
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1. Gaussian 

Calling pmpW the Marcenko-Pastur density of states of the empirical correlation matrices (with C = 1) we find, 
from the general expression above: 



^i§^ = l /dAp(A)[-logA + l-A] 



(38) 



In order to compute S'(Ei;E2) one has to calculate TrEj^ ^'E2/N. This can be performed noticing that the distri- 
bution law for these matrices is invariant under arbitrary independent rotations. Therefore we find 

(TrEr^E2/7V) = ^l(«l&)l'Ab) = 3^ At) = J dXpMpiX)X J dXpMp{X)X-' (39) 

Hence, the final result is: 

= + ^ / dXpMPiX)X I dXpMPi\)\ (40) 
It can be shown that these expressions coincide with the ones of [3 in the large N limit. 



2. Student 



Calling now ps(A) the density of states of the Wishart-Student matrices computed in section II above, we find 

5(E;C) 



TV 



i / dAps(A)logA + ilog( / dXpsiX)X 



Applying the same argument used in the Gaussian case, one also finds: 

^1^^ = \\og(j dXps{X)X\ + i log { I dXps{X)/X 



(41) 



(42) 



The numerical values of these entropies, computed for different Q ~ T/N and ^'s, are given in the Tables below. 



/i = 3 


g = 1.5 


= 2 


= 3 


= 5 


Z/N 


0.645126 


0.527893 


0.409243 


0.303255 


Z'/N 


0.990942 


0.730459 


0.519955 


0.361961 



/i = 4 


0= 1.5 


= 2 


= 3 


= 5 


Z/N 


0.445103 


0.336914 


0.233323 


0.149822 


Z'/N 


0.814573 


0.568792 


0.376484 


0.23867 




fi = 5 


0= 1.5 


= 2 


= 3 


= 5 


Z/N 


0.361844 


0.263336 


0.172947 


0.10532 


Z'/N 


0.739387 


0.502362 


0.320584 


0.193947 



A comparison of these values to financial data, along the same line as [l^, is left for a future work. Note that 
since the Maximum Likelihood Estimate of the a priori correlation matrix is a Wishart-Gaussian matrix in the large 
N limit (see section II), the values of Z^Z' found by [lB| in the case of Wishart-Gaussian matrices turn out to be 
correct for the Maximum Likelihood Estimate of Student correlation matrices, as very recently found numerically by 
Tumminello et al. [l3l- 
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IV. CONCLUSION 



111 this work, we have studied in some details a new ensemble of random correlation matrices related to multivariate 
Student (or more generally elliptic) random variables. We have found the exact density of states for the Pearson 
estimate of the correlation matrix for uncorrelated variables, that generalizes the Marcenko-Pastur result. It would be 
interesting to know whether the joint distribution of eigenvalues can also be computed exactly in this case. We have 
shown that for the Maximum Likelihood estimator, the density of states is still exactly given by the Marcenko-Pastur 
distribution. 

The comparison between the theoretical density of states in the Student case and empirical financial data is surpris- 
ingly good, in any case much better than the Marcenko-Pastur result. However, we arc still able to detect significant 
systematic deviations, which suggest the need of a richer, non-elliptic model for the joint distribution of returns, or 
the presence of information carrying, low eigenvalues of the correlation matrix (or both). 

Finally, we have computed cxplicitely the KuUback-Leibler entropies of empirical Student matrices, which are found 
to be independent of the true correlation matrix, as in the Gaussian case. Using our result on the density of states, 
we give the exact numerical value of the KuUback-Leibler entropies in various cases of interest. 



We thank Fabrizio Lillo for very useful discussions and for sending ref. [l7l | prior to publications, and the organizers 
of the second Cracow meeting on Random Matrices for providing us with the opportunity to put this work together. 
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